Search CORE

58 research outputs found

Error Graphs and the Reconstruction of Elements in Groups

Author: Levenshtein Vladimir
Siemons Johannes
Publication venue: 'Elsevier BV'
Publication date: 01/01/2009
Field of study

Packing and covering problems for metric spaces, and graphs in particular, are of essential interest in combinatorics and coding theory. They are formulated in terms of metric balls of vertices. We consider a new problem in graph theory which is also based on the consideration of metric balls of vertices, but which is distinct from the traditional packing and covering problems. This problem is motivated by applications in information transmission when redundancy of messages is not sufficient for their exact reconstruction, and applications in computational biology when one wishes to restore an evolutionary process. It can be defined as the reconstruction, or identification, of an unknown vertex in a given graph from a minimal number of vertices (erroneous or distorted patterns) in a metric ball of a given radius r around the unknown vertex. For this problem it is required to find minimum restrictions for such a reconstruction to be possible and also to find efficient reconstruction algorithms under such minimal restrictions. In this paper we define error graphs and investigate their basic properties. A particular class of error graphs occurs when the vertices of the graph are the elements of a group, and when the path metric is determined by a suitable set of group elements. These are the undirected Cayley graphs. Of particular interest is the transposition Cayley graph on the symmetric group which occurs in connection with the analysis of transpositional mutations in molecular biology. We obtain a complete solution of the above problems for the transposition Cayley graph on the symmetric group.Comment: Journal of Combinatorial Theory A 200

arXiv.org e-Print Archive

CiteSeerX

Elsevier - Publisher Connector

University of East Anglia digital repository

Reconstruction of permutations distorted by single transposition errors

Author: Konstantinova Elena
Levenshtein Vladimir
Siemons Johannes
Publication venue
Publication date: 01/01/2007
Field of study

The reconstruction problem for permutations on

n

elements from their erroneous patterns which are distorted by transpositions is presented in this paper. It is shown that for any

n \geq 3

an unknown permutation is uniquely reconstructible from 4 distinct permutations at transposition distance at most one from the unknown permutation. The {\it transposition distance} between two permutations is defined as the least number of transpositions needed to transform one into the other. The proposed approach is based on the investigation of structural properties of a corresponding Cayley graph. In the case of at most two transposition errors it is shown that

\frac32(n-2)(n+1)

erroneous patterns are required in order to reconstruct an unknown permutation. Similar results are obtained for two particular cases when permutations are distorted by given transpositions. These results confirm some bounds for regular graphs which are also presented in this paper.Comment: 5 pages, Report of paper presented at ISIT-200

arXiv.org e-Print Archive

University of East Anglia digital repository

Application of cover-free codes and combinatorial designs to two-stage testing

Author: Berger Toby
Levenshtein Vladimir I.
Publication venue: Elsevier Science B.V.
Publication date: 15/05/2003
Field of study

AbstractWe study combinatorial and probabilistic properties of cover-free codes and block designs which are useful for their efficient application as the first stage of two-stage group testing procedures. Particular attention is paid to these procedures because of their importance in such applications as monoclonal antibody generation and cDNA library screening

Elsevier - Publisher Connector

A simple proof of the main inequalities for parameters of codes in polynomial association schemes

Author: Levenshtein Vladimir
Publication venue: HAL CCSD
Publication date: 01/01/1994
Field of study

Résumé disponible dans le fichier PD

INRIA a CCSD electronic archive server

06201 Abstracts Collection -- Combinatorial and Algorithmic Foundations of Pattern and Association Discovery

Author: Ahlswede Rudolf
Apostolico Alberto
Levenshtein Vladimir I.
Publication venue: Dagstuhl Seminar Proceedings. 06201 - Combinatorial and Algorithmic Foundations of Pattern and Association Discovery
Publication date: 01/01/2006
Field of study

From 15.05.06 to 20.05.06, the Dagstuhl Seminar 06201 ``Combinatorial and Algorithmic Foundations of Pattern and Association Discovery\u27\u27 was held in the International Conference and Research Center (IBFI), Schloss Dagstuhl. During the seminar, several participants presented their current research, and ongoing work and open problems were discussed. Abstracts of the presentations given during the seminar as well as abstracts of seminar results and ideas are put together in this paper. The first section describes the seminar topics and goals in general. Links to extended abstracts or full papers are provided, if available

Dagstuhl Research Online Publication Server

Debugging Inputs

Author: Levenshtein Vladimir I.
Muşlu Kıvanç
Raman Vijayshankar
Scheffczyk Jan
Publication venue
Publication date: 01/06/2020
Field of study

When a program fails to process an input, it need not be the program code that is at fault. It can also be that the input data is faulty, for instance as result of data corruption. To get the data processed, one then has to debug the input data—that is, (1) identify which parts of the input data prevent processing, and (2) recover as much of the (valuable) input data as possible. In this paper, we present a general-purpose algorithm called ddmax that addresses these problems automatically. Through experiments, ddmax maximizes the subset of the input that can still be processed by the program, thus recovering and repairing as much data as possible; the difference between the original failing input and the “maximized” passing input includes all input fragments that could not be processed. To the best of our knowledge, ddmax is the first approach that fixes faults in the input data without requiring program analysis. In our evaluation, ddmax repaired about 69% of input files and recovered about 78% of data within one minute per input

CISPA – Helmholtz-Zentrum für Informationssicherheit

Crossref

Open Repository and Bibliography - Luxembourg

Automated Implementation of Windows-related Security-Configuration Guides

Author: Adam
Bird Steven
Birkholz Henk
Levenshtein Vladimir I
Raab Markus
Stöckle Patrick
Xu Tianyin
Yang J.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 19/09/2022
Field of study

Hardening is the process of configuring IT systems to ensure the security of the systems' components and data they process or store. The complexity of contemporary IT infrastructures, however, renders manual security hardening and maintenance a daunting task. In many organizations, security-configuration guides expressed in the SCAP (Security Content Automation Protocol) are used as a basis for hardening, but these guides by themselves provide no means for automatically implementing the required configurations. In this paper, we propose an approach to automatically extract the relevant information from publicly available security-configuration guides for Windows operating systems using natural language processing. In a second step, the extracted information is verified using the information of available settings stored in the Windows Administrative Template files, in which the majority of Windows configuration settings is defined. We show that our implementation of this approach can extract and implement 83% of the rules without any manual effort and 96% with minimal manual effort. Furthermore, we conduct a study with 12 state-of-the-art guides consisting of 2014 rules with automatic checks and show that our tooling can implement at least 97% of them correctly. We have thus significantly reduced the effort of securing systems based on existing security-configuration guides

arXiv.org e-Print Archive

Crossref

Improving OCR of historical newspapers and journals published in Finland

Author: Breuel Thomas M
Drobac Senka
Jauhiainen Tommi Sakari
Levenshtein Vladimir I.
Publication venue: 'American College of Medical Physics (ACMP)'
Publication date: 01/01/2019
Field of study

Peer reviewe

Crossref

Helsingin yliopiston digitaalinen arkisto

Generate FAIR Literature Surveys with Scholarly Knowledge Graphs

Author: Araujo Samur
Bikakis Nikos
Gromann Dagmar
Iglesias Alejandro Rodrí
Jaradeh Mohamad Yaser
Jaradeh Mohamad Yaser
Kohl Christian
Levenshtein Vladimir I
Maillot Pierre
Oelen Allard
Petrova Alina
Randolph Justus J.
Vahdati Sahar
Webster Jane
Winkler William E.
Publication venue: New York City, NY : Association for Computing Machinery
Publication date: 01/01/2020
Field of study

Reviewing scientific literature is a cumbersome, time consuming but crucial activity in research. Leveraging a scholarly knowledge graph, we present a methodology and a system for comparing scholarly literature, in particular research contributions describing the addressed problem, utilized materials, employed methods and yielded results. The system can be used by researchers to quickly get familiar with existing work in a specific research domain (e.g., a concrete research question or hypothesis). Additionally, it can be used to publish literature surveys following the FAIR Data Principles. The methodology to create a research contribution comparison consists of multiple tasks, specifically: (a) finding similar contributions, (b) aligning contribution descriptions, (c) visualizing and finally (d) publishing the comparison. The methodology is implemented within the Open Research Knowledge Graph (ORKG), a scholarly infrastructure that enables researchers to collaboratively describe, find and compare research contributions. We evaluate the implementation using data extracted from published review articles. The evaluation also addresses the FAIRness of comparisons published with the ORKG

arXiv.org e-Print Archive

Crossref

Repositorium für Naturwissenschaften und Technik

Towards Effective Extraction and Linking of Software Mentions from User-Generated Support Tickets

Author: Aggarwal Vishalaksh
Bouma Gerlof
Han Jianglei
Jan Ea-Ee
Lafferty John
Lample Guillaume
Levenshtein Vladimir I
Mikolov Tomas
Potharaju Rahul
Putthividhya Duangmanee Pew
Rüd Stefan
Yang Shuo
Publication venue: 'American College of Medical Physics (ACMP)'
Publication date: 01/10/2018
Field of study

Software support tickets contain short and noisy text from the customers. Software products are often represented by various surface forms and informal abbreviations. Automatically identifying software mentions from support tickets and determining the official names and versions are helpful for many downstream applications, \eg routing the support tickets to the right expert groups for support. In this work, we study the problem ofsoftware product name extraction andlinking from support tickets. We first annotate and analyze sampled tickets to understand the language patterns. Next, we design features using local, contextual, and external information sources, for extraction and linking models. In experiments, we show that linear models with the proposed features are able to deliver better and more consistent results, compared with the state-of-the-art baseline models, even on dataset with sparse labels

Crossref

UCL Discovery